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Summary 


Problem;  Allow  epidemiologists  to  examine  ways  to  obtain  and  assess  clinical  data  as 
quickly  and  accurately  as  possible  while  in  a  combat  setting.  Epidemiologists  and 
preventive  medicine  specialists  need  ways  to  relate  patient  signs  and  symptoms  data 
collected  at  the  time  of  presentation  into  associated  categories  to  help  in  the  early 
detection  of  any  adverse  health  event. 

Objective:  To  develop  a  computerized  algorithm  that  would  accept  sequentially  in  real 
time  patient  symptom  data  (vectors)  and  dynamically  identify  patterns  of  probable 
syndromes  or  diseases  so  timely  preventive  measures  can  be  taken. 

Approach:  A  simple  Adaptive  Resonance  Theory  (ART)  algorithm  was  developed  that 
was  capable  of  sequentially  accepting  patient  symptom  vectors  and  dynamically 
clustering  them  into  patterns  or  syndromes  described  by  prototype  symptom  vectors.  A 
modified  ART  algorithm  was  also  proposed  to  deal  with  syndromes  that  do  not  possess  a 
strong  core  of  symptoms. 

Results:  The  basic  ART  algorithm  was  demonstrated  with  simulated  symptom  data  from 
300  simulated  patients  representing  10  underlying  syndromes.  The  algorithm  was  shown 
to  correctly  identify  the  syndromes  and  to  cluster  the  patients  into  their  correct 
syndromes. 

Conclusions:  Based  on  the  results  of  this  simulation,  ART  algorithms  appear  to 
accomplish  the  stated  objective  for  simulated  data.  The  basic  ART  algorithm  is  simple, 
easily  implemented,  and  capable  of  running  in  real  time  on  modest  computing  resources 
(e.g.,  a  laptop  PC).  Future  work  on  these  algorithms  should  include  testing  on  real  field 
data  and  experimentation  to  determine  optimum  settings  for  the  algorithm  parameters. 
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1.  INTRODUCTION 


The  health  and  well-being  of  groups  of  U.S.  service  personnel  assigned  to  duty  in 
foreign  countries  is  of  vital  importance  to  the  successful  completion  of  the  intended 
missions.  Upon  relocation,  such  groups  represent  nonindigenous  populations  exposed  to 
the  endemic  diseases  of  the  region  and  other  hazards  associated  with  the  assignment. 
Within  these  groups,  outbreaks  of  debilitating  diseases/syndromes  can  render  many 
individuals  unable  to  perform  their  duties,  placing  a  severe  strain  on  medical  resources, 
jeopardizing  their  missions,  and  possibly  endangering  the  lives  of  those  stricken. 

Field  medical  personnel  need  a  way  to  continuously  monitor  the  population  and  to 
quickly  detect  patterns  of  affliction.  Since  the  number  of  medical  doctors  available  in  the 
field  is  usually  very  limited,  this  is  of  particular  importance  to  corpsman  who  do  not  have 
the  training  or  resources  to  make  a  definitive  diagnosis  in  the  field.  Typically,  a  sick 
patient  is  examined  in  the  field  by  a  corpsman.  The  corpsman  records  signs  and 
symptoms  for  the  patient  and  enters  them  into  a  database.  The  definitive  diagnosis  and 
etiology  of  the  disease/syndrome,  however,  are  not  determined  until  much  later  when  the 
patient  is  transferred  to  a  field  hospital  or  back  to  the  home  base  where  doctors  can 
perform  diagnostics  and  laboratory  analyses.  One  consequence  of  this  delay  is  that  as 
patients  enter  this  system  over  time,  clusters  of  patients  afflicted  with  the  same 
disease/syndrome  can  form  and  go  imdetected  until  there  is  a  high  toll. 

What  is  needed  is  a  method  of  examining  the  sign-symptom  profiles  in  near  real 
time,  as  well  as  detecting  clusters  of  affliction,  which  might  be  called  “syndromes,”  and 
their  associated  descriptions  so  that  preventive  measures  can  be  instituted  before  the  loss 
of  productive  personnel  can  drain  resources.  For  example,  a  cluster  of  patients  with 
severe  gastroenteritis  will  lead  authorities  to  examine  food,  water,  and  sanitary  conditions 
first  as  obvious  culprits  in  an  attempt  to  locate  the  source  and  limit  the  number  of  new 
cases.  Note  that  the  emphasis  here  is  on  timely  detection  and  prevention;  the  first  priority 
in  the  field  is  to  stop  the  progression  rather  than  an  definitive  diagnosis  of  the  disease  or 
determination  of  its  etiology. 
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1.  STUDY  OF  APPROACHES 


The  problem  of  aggregating  sign-symptom  data  falls  into  the  general  theme  of 
classification,  clustering,  and  pattern  recognition.  There  is  a  plethora  of  medical  research 
in  which  classification,  clustering,  and  pattern  recognition  methods  have  been  useful* 
Two  examples  are  the  work  of  Ciampi  et.al,  and  Knstensen.  Unfortunately,  the 
problem  of  interest  in  this  study  does  not  fit  neatly  into  any  of  the  classical  paradigms  for 
which  analytical  solutions  are  available  (See,  for  example,  Nadler  and  Smith,  and 
Fuktmaga^  for  mathematical  descriptions  of  these  classical  paradigms).  Generally, 
algorithms  for  classification,  clustering,  and  pattern  recognition  involve  either  supervised 
or  unsupervised  learning,  and  the  number  of  clusters  present  is  assumed  known  in 
advance. 

In  supervised  learning,  a  large  representative  training  sample  of  individual  sign- 
symptom  vectors  with  associated  cluster  information  (e.g.,  definitive  diagnosis)  are  used 
to  "train"  a  complex  algorithm  to  recognize  the  patterns  associated  with  each  cluster. 
Here,  "training"  usually  entails  repeatedly  presenting  the  training  data  to  the  algorithm 
while  continuously  adjusting  a  number  of  parameters  until  a  desired  classification 
accuracy  is  achieved.  Variations  on  this  include  the  use  of  cross-validation  (holding 
certain  individuals  out  of  the  training  sample  for  measuring  accuracy).  Computerized 
disease  diagnosis  algorithms  are  available  and  in  use  today,  some  of  which  may  be 
purchased  at  local  software  stores  and  run  on  personal  computers  equipped  with  CD 
ROM  (compact  disk,  read  only  memory).  Most  conduct  an  interview  with  the  patient  (or 
attending  medical  person),  asking  questions  until  a  certainty  threshold  for  diagnosis  is 
exceeded.  The  success  of  these  systems  is  completely  determined  by  the  reliability, 
accuracy,  and  size  of  the  training  database. 


*  A  comprehensive  bibliography  on  pattern  recognition  and  cluster  analysis  is  included  in 
the  reference  section  of  this  report.  For  more  information,  please  see  references  1-9,  12, 
13-17,  and  21. 
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A  variation  on  supervised  learning  makes  use  of  expert  opinion  in  place  of  data. 
Intriguing  work  in  this  area  includes  that  of  Saaty/*’  who  developed  a  mathematical 
technique  for  making  decisions  based  on  inconsistent  expert  opinions.  In  the  problem  of 
interest  to  this  study,  usually  insufficient  or  no  training  data  are  available.  This  is  true 
because  the  symptomology  associated  with  endemic  diseases  may  be  well  understood  for 
an  indigenous  population,  but  not  so  with  a  nonindigenous  population.  Also,  the  umque 
circumstances  associated  with  any  military  mission  can  lead  to  unprecedented  epidemics 
and  conditions.  These  considerations  rule  out  the  use  of  supervised  leammg. 

In  unsupervised  learning,  the  algorithm  is  designed  to  form  clusters  based  on 
patterns  existing  in  a  large  set  of  data  that  have  been  collected.  Unlike  supervised 
learning,  the  data  provided  include  the  patterns  but  none  contain  the  true  classification  or 
cluster  membership  information.  One  successful  algorithm  is  the  "K-means"  algorithm  as 
described  in  Schalkoff.^°  Implicit  in  this  and  other  algorithms  that  use  unsupervised 
learning  is  the  assumption  that  K  (known)  clusters  exist.  In  the  problem  addressed  by 
this  study,  the  number  of  clusters  is  unknown  in  advance,  and  the  data  become  available 
sequentially  (as  opposed  to  being  supplied  in  one  large  complete  group).  It  is  possible  to 
guess  an  upper  bound  on  the  number  of  clusters.  For  example,  one  could  take  the  number 
of  incapacitating  endemic  diseases  for  the  region  and  add  one  or  two  to  allow  for  the 
possibility  of  the  emergence  of  a  new  disease  or  syndrome.  Still,  there  is  no  way  to  know 
which,  and  how  many,  clusters  will  become  active  until  data  are  entered. 
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3.  DYNAMIC  AGGREGATION  ALGORITHM 

What  is  needed  for  the  problem  faced  in  this  investigation  is  a  real-time 
unsupervised  learning  system  than  can  adapt  to  the  sign-symptom  data  as  they  are 
collected.  Adaptive  Resonance  Theory  (ART)  gives  several  prototypes  of  such  a  system 
(See  Hertz  et.al.^°  for  a  discussion  and  further  references  to  ART).  We  have  finished 
coding  an  ART  algorithm  (later  referred  to  as  “ARTl”)  that  seems  to  satisfy  the  need  for 
the  algorithm  to  adapt.  To  describe  it,  some  assumptions  and  special  notations  are 
needed.  First,  it  is  assumed  that  each  patient  can  be  represented  by  a  binary  vector  of 
fixed  length  n.  Each  component  of  the  symptom  vector  represents  a  given  sign  or 
symptom,  with  the  presence  or  absence  of  that  sign  or  symptom  indicated  by  a  1  or  0, 
respectively.  Next,  we  assume  that  no  more  than  K  classes  or  clusters  exist.  This  value 
need  not  be  exact,  but  care  should  be  taken  to  ensure  that  it  is  at  least  an  upper  bound  on 
the  total  number  of  possible  clusters  (e.g.,  K  =  the  number  of  debilitating  endemic 
diseases  +  5).  Associated  with  each  value  of  the  index  i  =  1,  2,  3,  ...,  K  is  a  prototype 
vector  Wj .  Initially,  Wj  =  1  =  (1,1,....,!)  for  each  i,  that  is,  an  n-vector  of  Is.  For  vectors  a 
and  b,  the  notation  a  •  b  signifies  the  usual  vector  dot  product.  For  binary  vectors  u  and 
V,  u(AND)v  denotes  the  binary  vector  formed  by  multiplication  of  the  respective 
components  of  u  and  v.  The  number  s  is  a  small  positive  number,  for  example,  g  =  0.01. 
Finally,  p,  0  <  p  <  1,  is  a  “tuning  parameter”  that  is  set  in  advance.  As  will  be  seen  later, 
it  is  a  “vigilance”  parameter,  controlling  the  ease  with  which  new  clusters  come  into 
existence,  as  detected  by  the  algorithm.  A  generic  sign-symptom  vector  for  a  patient  is 
denoted  by  The  ART  algorithm  is  described  in  steps  0  through  5  as  follows. 

0.  If  there  are  no  more  data,  stop.  Otherwise,  present  the  next  symptom  vector 
and  classify  it  as  “unclassified.” 

1 .  “Enable”  all  clusters,  i  =  1,  2, ...,  K. 

2.  Determine  i*  such  that 

^  £,  •  w «  'l  ft*  w  1 

— - - —  =  maxs— ^ - '■ — :i  corresponds  to  an  enabled  cluster  > 

Vw,.,  •l  +  sy  [w;  •1  +  g  J 
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3.  Compute 

4*1 

If  r  <  p  then  “disable”  cluster  i*  and  go  to  Step  2.  If  there  are  no  enabled 
clusters  left,  then  leave  ^  unclassified  and  go  to  Step  0.  Otherwise,  go  to 
Step  4. 

4.  Classify  ^  into  cluster  i*,  and  adjust  Wj*  by  replacing  it  with  Wi*(AND)^. 

5.  Go  to  Step  0. 

This  algorithm  solves  the  various  problems  raised  earlier  with  more  conventional 
clustering  algorithms.  It  forms  new  clusters  as  needed  (up  to  a  maximum  of  K)  and 
produces  a  prototype  vector  that  represents  each  cluster  (i.e.,  the  w-vectors).  The 
significance  of  the  vigilance  parameter  p  is  also  clear.  As  p  becomes  closer  to  1,  higher 
correlation  with  an  existing  prototype  is  required  in  order  to  classify  a  vector  into  that 
cluster,  and  hence  it  is  more  likely  that  a  new  cluster  will  be  established.  Conversely,  as 
p  becomes  closer  to  0,  less  correlation  is  needed  to  classify  a  vector  into  an  existing 
cluster,  and  hence  new  clusters  are  less  likely  to  be  formed. 
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4.  EXPERIMENTAL  RESULTS 


This  algorithm  has  been  coded  in  Microsoft  Qbasic  (source  listing  in  Appendix  I). 
A  prototype  with  graphical  user  interface  for  demonstration  purposes  was  also  coded  in 
Microsoft  Visual  Basic  (see  Appendix  V  for  a  picture  of  that  interface).  Both  versions 
were  tested  on  simulated  symptom  vectors  with  a  high  degree  of  success.  We  created  a 
simulated  database  of  300  patient  symptom  vectors  using  Qbasic  code  shown  in 
Appendix  II.  This  code  forms  300  symptom  vectors  of  length  70  using  the  following 
method.  First,  10  core  syndrome  vectors  are  formed  by  selecting  7  symptoms  for  each 
(symptoms  do  not  overlap).  A  core  symptom  vector  is  chosen  at  random,  and  the  current 
symptom  vector  is  set  equal  to  it.  Independently,  each  symptom  not  present  is  added  with 
probability  2/63.  (Thus,  an  average  of  2  symptoms  are  added  over  and  above  the  core 
symptoms).  The  symptom  vector  is  then  output.  This  process  is  repeated  300  times. 
One  run  yielded  the  data  shown  in  Appendix  III.  Because  patients  are  randomly  selected 
to  have  one  of  the  10  syndromes,  the  expected  number  of  patients  (out  of  300) 
representing  each  syndrome  is  30.  For  the  data  in  Appendix  III,  there  were  30,  34, 29,  32, 
24,  26,  31,  37, 28  and  29  representatives  from  the  syndromes  1,2,. ..,10,  respectively. 

The  ART  algorithm  was  presented  with  these  data  and,  processing  each  patient 
sequentially,  was  able  to  correctly  identify  all  10  syndromes  and  correctly  classify  each  of 
the  300  patients  into  the  correct  syndrome.  The  result  of  running  the  ART  algorithm  is 
shown  in  Appendix  IV  (ART  assigns  different  numerical  names  to  the  syndromes  than 
those  originally  assigned).  A  visual  display  of  the  10  syndromes  detected  by  the  ART 
algorithm  is  shown  in  the  Figure  1,  where  each  syndrome  cluster  prototype  is  represented 
by  a  bar  graph  with  each  s5miptom  present  depicted  by  a  black  bar.  The  number  below 
each  prototype  is  the  number  of  patients  aggregated  into  that  cluster  by  the  ART 
algorithm.  Note  that  each  prototype  has  7  symptoms  (the  number  of  core  symptoms 
originally  specified  in  the  simulation).  These  are  exactly  the  original  core  symptom 
vectors  used  to  generate  the  input  data  to  the  ART  algorithm.  Note  that  ART  does  not 
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Fiaure  1.  Svndrome  Prototvoes  Detected  bv  the  Art  Alaorithm 


l  U  syndromes 

7  core  symptoms  per  syndrome 


provide  a  diagnosis  for  each  cluster,  only  a  prototype  symptom  vector.  It  will  be  the  job 
of  physicians  and  epidemiologists  to  deduce  from  the  symptoms  in  the  prototype  vector 
the  probable  etiology  and/or  preventive  action  to  be  taken. 
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5.  ALTERNATE  ART  ALGORITHM 

In  one  scenario  the  ART  algorithm  will  perform  in  a  way  that  may  not  be 
appropriate.  That  scenario  is  one  in  which  the  manifestations  of  the  prototype  diseases  are 
highly  variable.  Notice  that  in  Step  4  of  the  ART  algorithm,  the  prototype  vector 
representing  a  cluster  is  “masked”  by  each  new  vector  that  is  classified  into  that  cluster. 
Suppose,  for  example,  that  after  a  large  number  of  vectors  have  been  clustered,  the 
prototype  vector  for  disease  cluster  1  is  (000111000).  If  the  vectors  (000011000), 
(000001000)  are  then  presented,  in  that  order,  it  would  be  possible  for  all  of  these  to  be 
classified  into  this  same  cluster.  If  this  happens,  the  prototype  vector  would  degenerate 
into  (000001000),  which  is  no  longer  an  adequate  descriptor  of  the  cluster.  This  situation 
would  be  caused  by  high  variation  in  how  a  disease  affects  the  individuals  in  the 
population.  It  would  not  occur  if  each  syndrome  has  its  own  strong  core  of  symptoms 
(i.e.,  symptoms  that  are  always  present  for  patients  afflicted  with  that  syndrome). 
Whether  this  can  be  a  problem  with  this  ART  algorithm  will  depend  on  the  analysis  of 
some  real  and  whether  this  is  desirable  must  be  determined  by  the  user.  We  have 
developed  an  alternate  ART  algorithm  that  is  more  robust  to  the  situation  described  here. 
The  choice  of  classifying  algorithm  depends  on  the  answer  to  several  questions,  including 
these: 

1 .  Must  the  classes  be  described  by  prototypes  whose  components  are  0  or  1  only? 

2.  Should  a  particular  symptom  be  removed  from  a  syndrome  prototype  if  even  one 
patient  who  otherwise  displays  the  symptoms  of  the  syndrome  does  not  display  the 
particular  symptom  in  question  (i.e.,  should  syndromes  be  masked?)? 

3.  Should  past  information  be  dominated  by  more  recent  information?  Or  should  old 
observations  and  new  ones  be  weighted  equally? 

4.  What  should  be  the  measure  of  closeness  of  symptom  vectors? 
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For  certain  problems,  the  answer  to  question  1  is  “yes,”  and  the  ARTl  algorithm 
is  suitable.  In  that  case  the  answer  to  question  2  is  almost  forced  to  be  “yes,”  also.  If  the 
answer  to  the  first  two  questions  is  “no,”  then  a  very  simple  alternate  scheme  is  available, 
which  we  call  the  “centroid  algorithm.”  Two  parameters  in  that  algorithm  are  set  to 
reflect  the  answer  to  the  third  question:  all  patients  may  weigh  equally,  or  the  weight  laid 
on  old  observations  may  be  allowed  to  diminish.  The  effectiveness  is  relatively 
insensitive  to  the  choice  of  measure  of  closeness,  although  it  is  sensible  to  give  some 
thought  to  this  point.  Here,  we  use  the  cosine  of  the  angle  between  the  symptom  vectors. 
This  says  that  the  difference  measure  between  them  is  related  to  the  ratio  of  the  number 
of  symptoms  on  which  they  disagree  to  the  number  of  symptoms  each  presents.  Other 
measures  might  well  be  used  (e.g.,  just  the  number  of  symptoms  on  which  they  differ;  of 
course  then  the  vigilance  parameter  would  not  be  between  0  and  1). 

In  brief,  the  algorithm  is:  first,  initialize  all  prototypes  W,  =0,  then  read  in  a 
symptom  vector  s  and  find  the  i  that  maximizes,  over  all  classes  in  use,  the  cosine  of  the 

angle  between  1^,  and  s,  that  is: 


w-  •  s 


Here,  ||*  ||  is  the  usual  Euclidean  norm.  If  that  value  exceeds  the  vigilance  parameter, 
then  assign  syndrome  i  as  the  class  of  the  patient.  If  not,  then  put  a  new  class  in  use,  and 
define  its  syndrome  prototype  vector  to  be  s,  unless  all  possible  classes  have  been  used. 
If  the  classification  was  successful,  then  update  the  prototype  syndrome  vector  according 
to: 

w.  ^  a  ■  w.  +  b  ■  s 

where  a,  b  represent  the  weight  of  old  and  new  observations:  a  =  1,  b  -  1  corresponds 
to  equal  weight,  and  convex  combinations  a  +  b  =  1  correspond  to  eroding  the  weight 
of  past  observations;  the  smaller  the  value  of  a  relative  to  b,  then  the  faster  the  rate  at 
which  past  observations  are  eroded.  This  algorithm  yields  results  similar  to  those  of 
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ARTl  on  our  data,  and  does  not  mask  the  classes  as  ARTl  does.  The  components  of  the 
syndrome  will  represent  values  between  0  and  1.  Those  symptoms  that  occur  more  often 
will  have  values  close  to  1.  In  general,  the  most  frequent  symptoms  will  have  higher 
values  than  those  that  appear  less  frequently.  If  a  and  b  are  set  to  1,  then  after  we  add  all 
symptom  vectors  to  the  syndrome  and  divide  each  element  of  the  syndrome  vector  by  the 
number  of  symptom  vectors  assigned  to  it,  we  will  get  a  vector  where  each  component 
represents  a  probability  of  occurrence  of  each  particular  symptom.  Preliminary  results 
indicate  that  this  algorithm  is  also  very  effective  in  classifying  data  of  the  type  generated 
to  test  ARTl,  for  which  masking  is  appropriate,  but  it  is  also  effective  with  data  that 
present  “missing  symptoms.” 
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6.  FUTURE  WORK 


The  experimental  work  with  the  ART  algorithm  should  be  continued.  The  next 
step  regarding  the  experimentation  will  be  to  develop  procedures  or  guidelines  for  setting 
the  vigilance  parameter.  This  can  be  accomplished  with  further  analysis  and  simulation 
studies.  Another  important  activity  is  testing  the  Alternate  ART  algorithm,  in  particular, 
the  influence  of  the  values  of  the  a  and  Z)  parameters  on  the  rate  of  erosion  of  previous 
observations.  For  both  algorithms,  it  is  important  to  perform  tests  on  real  field  data. 
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Appendix  I.  QBASIC  Code  for  the  ART  Algorithm 


REM  This  is  a  simple  adaptive  resonance  algorithm. 

REM  Professor  J.  Angus  -  Mathematics  Clinic  -  Spring  Term,  1995 
REM 

REM  W  holds  the  prototype  vectors 
REM  S  holds  the  current  data  vector 
REM  VALUES  holds  the  current  "scores" 

REM  NORDER  holds  the  indices  of  the  sorted  VALUES 
REM  rho  is  the  threshold  for  determining  a  new  cluster 
REM  k  is  the  maximum  number  of  clusters 

REM  sort  takes  arrayin  and  sorts  it  into  arrayout  in  ascending  order. 
REM  norder  maintains  the  same  permutation  of  1, 2 n  as  was 
REM  determined  by  sorting  arrayin. 

REM  wdotl  computes  the  dot  product  of  the  iith  row  of  w  with  the  vector 
REM  of  all  ones 

REM  eps  is  epsilon  (value  used  to  help  break  ties) 

REM 

DECLARE  SUB  sort  (arrayin{),  arrayout(),  norderQ,  n) 

DECLARE  SUB  wdotl  (iil,  sum!,  n) 

DIM  SHARED  w(20,  70) 

DIM  s(70),  counts(20) 

DIM  norder(20),  values{20) 
k  =  20 
rho  =  .5 
eps  =  .01 

REM  open  input  data  file 

OPEN  "d:\nhrc95\symptom.txt''  FOR  INPUT  AS  #1 
REM 

REM  open  output  data  file 

OPEN  "d:\nhrc95\testout.txt"  FOR  OUTPUT  AS  #2 
REM  read  the  number  of  columns  in  the  symptom  vectors 
INPUT  #1,  ns,  ncases 
REM 

REM  initialize  the  prototype  vectors. 

FOR  i  =  1  TO  20 
counts(i)  =  0 
FOR  j  =  1  TO  ns 
w(i,  j)  =  1 
NEXT) 

NEXT! 

FOR  lease  =  1  TO  ncases 
LINE  INPUT  #1.  data$ 
actual  =  VAL(MID$(data$,  72)) 

FOR  i  =  1  TO  ns 

s(i)  =  VAL(MID$(data$,  i,  1)) 

NEXTi 
sdotl  =  0 

REM 

REM  compute  dot  product  of  data  vector  with  vector  of  ones 
REM 
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FOR  i  =  1  TO  ns 

sdot1  =  sdot1  +  s(i) 

NEXTi 

REM  initial  class  is  0="unclassified" 
iciass  =  0 
FOR  i  =  1  TO  20 
values(i)  =  0 
norder(i)  =  i 

CALLwdot1(i,  sum,  ns) 

REM 

REM  compute  the  scores 
REM 

FOR  j  =  1  TO  ns 

values(i)  =  values(i)  +  w(i,  j)  *  s(j)  /  (sum  +  eps) 

NEXT] 

NEXTi 

FOR  i  =  1  TO  20 
■PRINT  #2,  values(i); 

NEXTi 
■Print  #2,  ■■  ■■ 

REM 

REM  sort  the  scored  and  keep  track  of  the  original  indices 
CALL  sort(values(),  values(),  norder(),  k) 

■FOR  i  =  1  TO  20 
■PRINT  #2,  values(i): 

■NEXT  i 
■PRINT  #2,  ■■  ■' 

■FOR  i  =  1  TO  20 
■PRINT  #2,  norder(i); 

■NEXT  i 
■PRINT  #2,  ’■  ■■ 

REM 

REM  Start  with  the  best  score,  and  check  the  correlation  criterion. 
REM  If  a  classification  into  an  existing  class  can  be  made,  jump  out. 
REM  If  not,  decrement  k  by  1  and  try  the  next  best. 

REM 

FOR  ist  =  k  TO  1  STEP  -1 
istar  =  norder(ist) 
wdots  =  0 
FOR  i  =  1  TO  ns 

wdots  =  wdots  +  s(i)  *  w(istar,  i) 

NEXTi 

IF  wdots  /  sdot1  >=  rho  THEN 

■PRINT  #2,  istar,  wdots  /  sdot1 
iciass  =  istar 
GOTO  outofit 
END  IF 
NEXT  ist 

REM 

outofit; 

REM 

REM  Update  prototype  vector,  print  out  data  vector  and  its  class. 
REM 

FOR  i  =  1  TO  ns 
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w(iclass,  i)  =  w(iclass,  i)  *  s(i) 

NEXTi 

PRINT  #2,  l\/IID$(data$,  1,  70)  + " "  +  STR$(iclass)  + " "  +  STR$(actual) 
counts(iclass)  =  counts(iclass)  +  1 
NEXT  lease 

FOR  j  =  20  TO  1  STEP  -1 
PRINT #2,  countsO) 

NEXTj 
total  =  0 

FORj  =  1  TO20 

total  =  total  +  counts(j) 

NEXTj 

PRINT  #2,  total 
FORJ  =  1  TO20 
proto$  = "" 

IF  countsO)  >  0  THEN 
FOR  i  =  1  TO  70 

proto$  =  proto$  +  MID$(STR$(wO,  i)),  2. 1) 

NEXTi 

PRINT #2,  proto$  + ""  +  STR$(countsO)) 

END  IF 
NEXTJ 

PRINT  "Done" 

END 

SUB  sort  (arrayinO,  arrayout(),  norderO,  n) 

DIM  ra{n) 

STATIC  ra 
FOR i = 1  TO  n 

ra(i)  =  arrayin(i) 

NEXT  i 
begin: 
switch  =  0 
FOR i  =  2  TO  n 
j  =  i-1 

IF  raO)  >  ra(i)  THEN 
save  =  raO) 
raO)  =  ra(i) 
ra(i)  =  save 
lord  =  norder(j) 
norderO)  =  norder(i) 
norder(i)  =  lord 
switch  =  1 
END  IF 
NEXT  i 

IF  switch  =  1  GOTO  begin 
FOR  i  =  1  TO  n 
arrayout(i)  =  ra(i) 

NEXTi 
END  SUB 

SUB  wdot1  (ii,  sum,  n) 

DIMi 
sum  =  0 


20 


FOR  i  =  1  TO  n 

sum  =  sum  +  w(ii,  i) 

NEXTi 

END  SUB  *****«..* 

REM* 

REM  *  Adaptive  Resonance  Theory  Program  for  * 

REM  *  Dynamic  Clustering 

REM  *  CGS  Mathematics  Clinic  Spring  1995 

REM  *  Alex  Ornatsky 

REM  *  Amir  Niknejad 

REM* 

REM  ********************************************************** 
REM 

COMMON  SHARED  n 
DECLARE  FUNCTION  dot  (vx(),  vy()) 

DECLARE  FUNCTION  norm  (vx()) 

CLS 

PRINT  "Input  the  #  of  clusters: 

INPUT  k 

PRINT  "Input  the  #  of  symptoms; 

INPUT  n 

DIM  w(1  TO  k,  1  TO  n) 

DIM  c(n) 

DIM  w1(k,  n) 

DIM  x(n) 

DIM  y(n) 

DIM  en(k) 

DIM  u(k) 

DIM  h(n) 

DIM  w20(n) 

DIM  symp(n) 

DIM  w2(n) 

DIM  w3(n) 

DIM  w4(k,  n) 

PRINT 

REM  k  is  the  number  of  clusters 
u2  =  0 


PRINT 

REM  n  is  the  number  of  symptoms 
FOR i  =  1  TO  k 
FOR  j  =  1  TO  n 
w(i,  j)  =  1 
NEXT] 

NEXTi 

PRINT 

REM  w(i,j)  is  the  initial  prototype  vector 
FOR  i  =  1  TO  k 


u(i)  =  0 
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NEXTi 

PRINT 

REM  en[1  to  k]  is  enabling  indicator 

REM  uI1  to  k]  is  the  indicator  for  number  of  vectors  assigned  to  that 
REM  prototype 

REM  PRINT  "Input  vigilance  parameter ; 

REM  e-the  threshold  parameter 
vigilance  =  .5 
eps  =  .01 

OPEN  "a:\nhrc30.dat"  FOR  INPUT  AS  #1 
DO  WHILE  (NOT  EOF(1)) 

50  FORi  =  1TOn 

REM  PRINT  "Input  Symptom";  i; 


INPUT  #1,  symp(i) 

REM  INPUT  symp(i) 

NEXTi 

PRINT  "symp  vector  is:" 

FOR i  =  1  TO  n 
PRINT  symp(i); 

NEXT  i 
PRINT 

FOR  i  =  1  TO  k 
en(i)  =  1 
NEXTi 

FOR i  =  1  TO  k 
IF  en(i)  =  1  THEN 
FORj  =  1  TOn 
h(j)  =  w(i,  j) 

ca)  =  1 

NEXT] 

b  =  dot{h().  c()) 
epsnorm  =  eps  +  b 
FORj  =  1TOn 
w1(i,j)  =  w(i,j)/ epsnorm 
NEXTj 
END  IF 
NEXTi 

100  FORi  =  1  TOk 
IFen(i)  =  1  THEN 
FORj  =  1  TOn 
w20(j)  =  w1  (i,  j) 

NEXTj 

w2{i)  =  dot(w20(),  sympQ) 
END  IF 
NEXTi 
d  =  0 
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f=0 

FOR  i  =  1  TO  k 
IF  en(i)  =  1  THEN 

IFw2(i)  >=dTHEN 
d  =  w2(i) 
f=i 
END  IF 
END  IF 
NEXT! 

IF  en(f)  =  1  THEN 
FORj  =  1  TOn 
wSQ)  =  w(f.  j) 

NEXTj 

R  =  dot(w3(),  sympO) 
s  =  0 

FORj=  1  TOn 
s  =  s  +  symp(j) 

NEXTj 

IFs>0THEN 
R  =  R/s 
END  IF 
END  IF 

IF  R  <  vigilance  THEN 
en(f)  =  0 
sum1  =  0 
FOR i  =  1  TO  k 

sum1  =  sum1  +  en(i) 
NEXT! 

IFsumI  >0THEN 
GOTO  100 
ELSE  GOTO  200 
END  IF 
END  IF 

IF  R  >=  vigilance  THEN 
u(f)  =  u(f)  +  1 

done  =  1 
ELSE  done  =  0 
END  IF 

IF  done  =  1  THEN 
PRINT  "winning  index  is:";  f, 
FOR  j  =  1  TO  n 

w4(f,  j)  =  w(f.  j)  *  symp(j) 
PRINT  w4(f,j); 

NEXTj 

FOR  j  =  1  TO  n 
w(f,  j)  =  w4(f,  j) 

NEXTj 
PRINT " " 

GOTO  250 
END  IF 
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200  u2  =  u2  +  1 

PRINT  "ATTENTION  :  NO  MATCH!!!" 

PRINT  "NUMBER  OF  UNMATCHED  VECTORS:"; 

PRINT  u2 

250  PRINT  "enabling  vector  is:"; 

FOR  i  =  1  TO  k 
PRINT  en(i): 

NEXT! 

PRINT " " 

PRINT  "#  of  vectors/cluster:"; 

FOR  i  =  1  TO  k 
PRINT  u{i); 

NEXTi 

REM  PRINT  "Continue?  YES-1 ,  No-O"; 

REM  INPUT  V 
REM  IFv  =  0THEN 
REM  GOTO  300 

REM  ELSE  GOTO  50 

REM  END  IF 
PRINT 

LINE  INPUT  "Press  Return  for  the  next  symptom  vector";  mystring$ 

LOOP 
CLOSE  #1 


300  END 

FUNCTION  dot  (x(),  y()) 
sum  =  0 

FOR  i  =  1  TO  n 

sum  =  sum  +  x(i)  *  y(i) 

NEXTi 

dot  =  sum 

END  FUNCTION 

FUNCTION  norm  (a()) 
norm  =  dot(a().  a{)) 

END  FUNCTION 
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Appendix  II.  QBASIC  Symptom  Simulator 


REM  Symptom  simulator. 

REM  CGS  Mathematics  Clinic,  Professor  John  Angus,  Spring  Term,  1995 
REM 

DIM  count(IO),  permute(70) 

REM 

REM  Initialize  the  symptom  vector. 

REM 

s$= "0000000000000000000000000000000000000000000000000000000000000000000000 
per$  =  s$ 

OPEN  "d:\nhrc95\symptom.txt"  FOR  OUTPUT  AS  1 
PRINT  #1,  70,  300 
FORi  =  1  TO10 
count(i)  =  0 
NEXT! 

RANDOMIZE 

REM 

REM  Generate  a  random  permutation  of  the  integers  1 ,  2, ...,  70 
REM 

FORj  =  1  TO  70 
permuteG)  =  j 
NEXTj 

FOR  j  =  70  TO  2  STEP  -1 
i  =  INT(RND*j)  +  1 
p  =  permuteG) 
permuteG)  =  permute(i) 
permute(i)  =  p 
NEXTj 

FORJ=  1  TO  70 
PRINT  permuteG); 

NEXTj 

REM 

REM  Begin  loop  to  create  300  patients. 

REM 

FOR  i  =  1  TO  300 
REM 

REM  Select  a  syndrome  at  random. 

REM 

id  =  INT(10*RND)+  1 

REM 

count(id)  =  count(id)  +  1 

REM 

REM  Insert  the  core  symptoms  into  the  current  symptom  vector. 

REM 

FOR  j  =  1  TO  70 

MID$(s$,  j,  1)  =  "0" 

IF  j  <=  7  *  id  AND  j  >=  7  *  Gd  -  1)  +  1  THEN  MID$(s$,  j,  1)  =  "1" 

REM 

REM  Randomly  add  an  average  of  2  additional  symptoms  to  the  symptom  vector. 

REM 

u  =  RND 

IF  u  <=  2  /  63  AND  MID${s$,  j,  1)  =  "0"  THEN  MID$(s$,  j,  1)  =  "1" 
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REM 

NEXTj 

REM 

REM  Apply  the  same  random  permutation  to  each  symptom  vector. 
REM 

FOR  j  =  1  TO  70 

t$  =  MID$(s$,  permute(j),  1) 

MID${per$,j,  1)  =  t$ 

NEXTj 

REM 

REM  Print  the  symptom  vector. 

REM 

PRINT  #1 ,  per$  +  " "  +  STR$(id) 

NEXTi 

REM 

REM  Print  out  the  number  of  patients  in  each  syndrome  cluster. 
REM 

FORj  =  1  TO  10 

PRINT  #1,  count(j) 

NEXTj 

END 
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Appendix  III.  300  Simulated  Symptom  Vectors 

The  first  two  numbers  are  the  length  of  the  symptom  vector  (i.e.,  number  of 
symptoms)  and  the  total  number  of  patients  (symptom  vectors)  generated.  Following 
those  are  the  300  symptom  vectors.  In  each  vector,  a  0  signifies  that  the  symptom  is  not 
present,  and  a  1  signifies  that  the  symptom  is  present.  The  last  number  in  a  row  is  the 
numerical  name  for  the  syndrome  cluster  to  which  the  patient  belongs.  Finally,  the  set  of 
numbers  at  the  end  of  the  data  set  are  the  total  number  of  patients  belonging  to  each 
syndrome  cluster. 

70  300 

000000000000000000000000000001 1 1 0000000000000001 0000001 1 0001 00010001 1 0  4 
000000010000000010001 000000001 1 00000000000001 001 0000001 1 00000001 00001 0  4 
0001 1 00000100000000000000000000001 00001 0000000000001 01 000000000001 0000  6 
00000000000001 1 00000000000000001 000000000000001 000001 000001 00010000000  8 
000001 0001001 001 001 0000000000000001 01 000000001 001 000000000000000001 000  9 
010000000000000010000000001 01 00001 000001 1 00000000000000000010000100000  1 
000000000000000001 0000101 00000000000000001 001 000000000001 00001 00000001  1 0 
000001 0000001 001 1 01 0000000000001 00101000000001 000000000000000000000000  9 
000000000000000001 0000101 000000001 00000001 001 000000000001 000001 0000001  1 0 
00001001000000000010000001 000000000000000001 000001000000010000000001 00  7 
000000000000000001 0010101 0000000000000000 1001 000000000001 000000000000 1  10 
0000001 00100000000001 1 010000000000000000000000001 00000000001 0000001 000  5 
00000000001 0000001 00001 01 000000000001 00001 0010000000000010000000000001  1 0 
1 000000000000000000000000001 0000000101 00001000000000000000001 1 00000000  3 
00000000000000000100001 01 00000000000000001 001000000000001 0000001 000001  1 0 
00001 001 000000000000010001 000100000000000001000001 00000001 0000000001 00  7 
0000001 00100000000001 1 000000000000001 000000000001 00000000001 0000001 000  5 
00100000100100000001 0001000000001 000001 1 00000000001 0000000000001 000000  2 
0001 0000001 00000000000000000000001 00001 000000000000101000000000001 0000  6 
0000000100000000010001 1 01 00000000000000001 001 000000000001 0000000000001  1 0 
0000010100001001 001 000000000000000101 000000001 000000000000000000000000  9 
01 000000000000001 0000000001 01 001 00000001 1 00000000000000000000000100000  1 
000000000000000000000000001 001 1 0000001 0000000001 0000001 1 00000001 000010  4 
1 00000000000001 0000000000001 00000001 01 00001000000000000000001 1 00000001  3 
1 0000000000000000000000000010000000101 00001000000000000000001 1 00000000  3 
0001 0000001 00000000000000000000001 1 0001000000000000101 000000000001 0000  6 
1 000000000000000000000000001 0000000101 00001000000000000000101 1 00000000  3 
0000001001 00000000001 1 0000000000001001 0000000000100000000001 0000001 000  5 
000000000000000100000000000001 1 00000000000000001 0000001 1 001 0000100001 0  4 
0001 0000001 00100000000000000000001 00001000000000000101 000000000001 0000  6 
0001 001 001 00000000001 1 000001 000000001 00000000000100000000001 0000001001  5 
00000000000001 1 00000000000000001 000000000000001 000001 0000010001 0000000  8 
0001 0000001 00000000000000001 001 001 00001000000000000101000000000001 0000  6 
000000100100000000001 1 0000000000000001 0000000000100000000001 0000001000  5 
0000101 1 000000000000000001 00000000001 000001 1 00000100000001 0000000001 00  7 
00000001000001 1 0000000000000001 1 000000000000001 000001 0000010001 0000000  8 
00001 001000000010000101001 000000000000000001 000001 000000010000000001 00  7 
01000000000000001 0000000001 0100000000001 1 000000000000000000000101 00000  1 
0100000000000000100000000010100000000011100000000000011000000000100000  1 
00000000000001 1 00000000000000001 000000000000001 000001 000001 01 01 0000000  8 
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000000100100000000001 1 000000001000000000000000001 00001 000001 0000001 000  5 
1 0000001 1 0000000000000000001 0000000101 1 0001001 000000000000001 1 00000000  3 
0000000000000000010000101 1 0000000000100001 001 0000000000010000000000001  1 0 
0001000000100000000000000000000001 00001000000000000101 000000000001 0000  6 
0000010000001001001 001 0000000001 00101000000001 000000000000000000000010  9 
100101 0000100000000000000000000001 000010000000000001 01 001000000001 0000  6 
0001000000100000000000000000000001 000010000000000101 01 000000000001 0000  6 
0001000000100000000000000000000001 00001 1 00000000000101 000000000001 0000  6 
00000000000001 1 00000000000000001000000000000001 1 0001 1 000001 00010000000  8 
00 1 00001 1 001 00000001 000 1 000000001 0000000000000000010000000000000000000  2 
01 0000001 000000010000000001 0 1 00000000001 1 0000001 00001 000000000001 00000  1 
0000010000001 001001 00000000000000010100000000100000001 000001 00011 00000  9 
001000001 001 00000001 000100000000100001 001 0000000001000000001 0000000000  2 
00000001000001 1 00000000000000001 01 0000000000001000001 00000100010000000  8 
000100000010000000000000001 0000001 00001 0000000000001 01 00000001 0001 0000  6 
00010010001 00000000000000001 00000100001 0000000000001 01 0000000000010101  6 
1 000000000000000000000000001 0000000101 00001000000000000000001 1 00000000  3 
000100000010000000000001 1 00000001 1 00001 0000000000001 01 000000000001 0000  6 
0000001 00100000000001 1 00000000001 000000000000000100000000001 0000001 000  5 
1 1 0000000001 0000 1 0000000001 1 1 00000000001 1 00000000000000000000000100000  1 
000001 0000001 001001 000001 1 00000000101000000001 00000000000000000001 0000  9 
001 1 0000100100000001 0001000000001 0000000000000000010000001000000000000  2 
1 000000000000000000000000001 0000000 1 01 00001000000000000000001 1 00000000  3 
1 000000000000000100000000001 0000000101 00001000000000000000001 1 00000000  3 
000000000000000100000000001001 1 00000000000000001 001001 1 1 000000010001 1 0  4 
000000000000000000000000000001100000000000000001 0000001 1 00100001000010  4 
000001 0000001 001 001000000000000000101000000001000000000000000000000000  9 
000000000000000001 0000101 00000000000000001001 000000000001 0000000000001  1 0 
0001001 00100000000001 1 0000000000000000000000000010000000001 1 0000001 000  5 
00001 001 000000000000000001 00000000000000000100000100000001 0000000001 00  7 
00000001 0000000000000000000001 1 00000000000000001 1 000001 1 00000001001010  4 
0000000000000000010000101 000001 00000000001001 0000000000010000000000001  1 0 
00001001 000000000000000001 00000000000000000100000 1 00000001 0000000001 00  71 
00000000000001 1 000000000001 1 0001 000000000000001 000001 000001000 1 0000000  8 
01 00000000000000100000000010100000000001 1 000000000000000000000001 00000  1 
001 000001 001 00000001 0001 000000001 0000000000000000010000000000000000000  2 
000000000000000000000000000001 1 00000000000001 0010000001 1 00000001000010  4 
0000000000000110000000000000000100000000000100100000100000100010000000  8 
000001 0000001 001 001 000000000000000101000000001000000000000000000000000  9 
1 000000000000000000000000001 0000000101 0001 1 000000000000000001 1 00000000  3 
01 00000010000000100000000010100000000001 10100001 0000000000000000100000  1 
0000001 001 1 0000000001 1 00000000000000000000000001 1 00000000001 0000001 000  5 
000001 0000001 001001 000000000000010101000000001 000000001000000000000000  9 
000000000000001 001 0000101 00000001 000000001 001 0000000000010000000000001  1 0 
0100000010000000100000000010101 1 000001 01 1 00000000000000000000000100000  1 
000001 1 000001 001001 000000000000000101000000001 000000000000000000000000  9 
000001 1 10100000000001 1 00000000000000000000000000100000000001 0000001 000  5 
0001 0000001 00000000000000000000001 000010010000000001 01 001000001 001 0000  6 
00000000000001 101000000000000001 000000000000001 000101 000001001 1 0000000  8 
000000000000000000000010000001 1 00000000001000001 1 1 00001 1 0000000100001 1  4 
0001 1 001 000000000000000001 000000000000000001000001000000010000000001 00  7 
00000000000001 1 00000000000000001 000000000000001 000001 000001 00010000000  8 
0100000000000000100000000010100000000001 1 000000000000000001 000001 00000  1 
000001 0000001 001001 000000000000000101 1 00000001 000000000001 000000000000  9 
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000000000000000001000010100000000010000001 001 010100000001000000001 1001  10 
1 000000000000000000100000001 0000001 1 01 00001 000000000000000001 1 00000000  3 
0100101001 00000000001 1 001 000000000000000000000001 00000000001 0000001 000  5 
0 1 00000000000000 1 000000000101 00000000001 1 1 00000000000000000000001 00000  1 
00000000000001 1 000001 00000000001 000000000000001 000001000001 00010000000  8 
1000000000000000000000000001000001010110001000000000000000001100000000  3 
001000000000000000000000000001100000000000000001 0000001 1 0000000100001 0  4 
001 000001 00101 000001 000 1 000000001 00000000000000001 1 00000000000000000 1 0  2 
01 00000000000000100000000010100000000001 1 000000000000000000000001 00000  1 
01000000000000001000000000101 00000000001 1 00000001 000000000000000100000  1 
00000000000000000100001 01 000000000000000010010000001 00001 0000000000101  1 0 
001000001 001 1 0000001 0001000000001010000000000000001 1 000000001 000000000  2 
000001 0000001 001 001 1 00000001 0000001 01 001 000001 0000000000000000001 00000  9 
0000000001 00000000000000000001 1 00000000000000001 0000001 1 00000001 00001 0  4 
000000100 1 00000000001 1 00000 1 00000000000000000000 1 000000000010000001 1 00  5 
001000001 0010000001 1 0001 000000001 01 0000000000000001 0000000000000000000  2 
000000000000000000000000000001 1 00000000000000001 0000001 1 00000001 00001 0  4 
0000000000000001 00000000100001100000000000000001 000000110000000100001 1  4 
0000100100000001 0000000001 00000000000000000101 000100000001 000000000100  7 
0000000000000000010000101 00001 000000000001 001 1 000000000010000000000001  1 0 
00000000000001 1 00000000000000001000000000000001 1 00001 000001 000 1 0000000  8 
0000000000000000000000101 00001 1 00000000000000001 0000001 1 00000001 00001 0  4 
00000000000001 0000000000000001 1 00000101 000000001 0000001 1 00000001 00001 0  4 
0001000001 1 00000000000100000000001 00101 01 0000000000101 000000000001 0000  6 
000000000000000000001 000000001 1 000000000000000010000001 1 0000000100001 0  4 
00000000000001 1 00000000000000001 000000000000001 000001 000001 000 1 0000000  8 
0100001 0000000001 0000000101 01 00000000001 1 000001 000000000000001 00100000  1 
000000000000000001 00001 01 00000000000000101001 1 000000001 010000000000001  1 0 
0000001 001 00000000001 1 000000000000000000000000001 00001 000001 0000001 000  5 
00001 001000000000000000001 000000000001 00000100000100000001 00000001 01 00  7 
001001 0000001 001 001 000000000000000101000000001 000000000000000000000000  9 
000001 0000001 001 001 000000000000001 1 01000000001 000000000000000000000000  9 
000001 0000001 011001 0000000000001 00101000000001 0000001 00000000000000000  9 
00010001001 00000000000000000000001 00001000000000000101 000000000001 0000  6 
001000001 00100000001 00010001 00001 000000000000000001 0000000000000000000  2 
1 000000000000000000000010001 1 0000001 01 01 001000000000000000001 1 0001 0000  3 
0010001 01001001 00001 0001 000000001 000000000000000001 0000000000000000000  2 
000000000000000001 0000101 00000000000000001 01 1 0000000000010000000000001  1 0 
001 00000 1 001 00000001 0001000000001 000000000000000001 0000000000000000000  2 
01 000000000000001 0000000001 01 00000000001 1 000000000000000000000001 00000  1 
1 0000000000000000000000000010001 100101 00001 000000000000000001 1 00000000  3 
00001 001000000000000000001 000000000000000001 000001 00000001 0000000001 00  7 
01 0000000100000001 00001 01 00000000000000001 01 1 0000000000010000000000001  1 0 
0010001 01001 00000001 001 1 000000001 0000000001000001 01 00000001 0000001 0000  2 
1100000000000000000000000001000000011100001000000000000000001100100000  3 
000000000000000000000000000001 1 01 0000000000000010000001 100000001 00001 0  4 
001000001 001 1 0000001 0001 000000001000000000000000001 000000001 000000001 0  2 
001000001 001 000100010001000000001 000000000000000001 0000000000000000000  2 
00001 000000000000100001 01 00000000000000001001 000000000001 0001 000000001  1 0 
0000101 00100000000001 1 00000001 000000000000000000100000100001 0000001 000  5 
1 0000000010000001 0000000000 1 0000000 1 01 0000 1 00000000000000000 1 1 00000000  3 
0000001 1 01 00001 0101011 000000000000000000000000001000001 00001 001001 1 000  5 
1 000001001 00100000001 1 1 0000000000000000000000000110000000001 0000001 000  5 
001000001 001 00000001 0001000000001 000000000000000001 0000000000000000000  2 
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001000001 001 00000001 0001001010001 1 000000000000000010000000000000000000  2 
1 000000000000000101 000000001 000000010101 001 000000000000000001 1 00000000  3 
0000001001 00000000001 1 000000000000000000000000001 000000100010000001 000  5 
01 001001 00000000000000001 1 00000101 0000000001000001 0000000101 00100001 00  7 
00000000000001 1 00000000000000001000000001 000001000001 1 00001 00010000000  8 
00100000100100000001 0001000000001 00000000000001 00010000000000000000000  2 
00001001 00000000000000000100000000000000000100000100000001001 0000001 00  7 
00000000000001 1 00000000000000001 000000000000001 000001 0000010001001 0000  8 
1 0000001 000000000001 000000010000000101 00001000000000000100001 1 00000010  3 
000101 000010000000000100001 0001 001 000010000000000001 01 1 00000000001 0000  6 
00100000100100000001 0001000000001 00000000000000000100000001 00000000000  2 
00001001 000000000000000001 000000000000000001000001 000000010000000001 00  7 
000000000001 000001000010100000001000000001 0010000000000010000000001001  1 0 
00000000001000001 1 0000101 00010101 000000101001 000000000001 1 000000000001  1 0 
00001 001 000000000000000001 00000000000000000100000100000001 0000000001 00  7 
01 00000000000000100000000010100000010001 1 0000000000000000000001 1 1 001 00  1 
000000000000000001 000010100000000000000001001 000001 0000010000000000001  1 0 
00001 001 000000000000000001 0000001 0000000000100000100000001 0000000001 1 0  7 
1000000000000000000000000001 0000000101 0000100000000000000001 1 1 00000000  3 
000000000000000000000000000001 1 1 00000000000000010000001 1 00000001000010  4 
01000000000000001 000000000101001 00000001 1 00001 0000000000000000001 00000  1 
00001 001 000000000000001001 000000000001 00000100000100000001 0000000001 00  7 
0000000000000000010000101 01 001 000000000001001 000000000001000000000001 1  1 0 
0010000010010000001 1 0001 000000001 0000000000000000010001 000100001000000  2 
0000001 00100000000001 1 00000000000000000100000000100000000001 0000001 000  5 
000000000000000001 000010100000000001000001001 0000000000010000000000001  1 0 
00100001 1 1 01 00000001 0001000000001 000001 000001 000001 00000000000000101 00  2 
1 00000000000000000000000000101000001010000101 0010010000000001 1 00000000  3 
00100000100100000001 0001 00000000100000000000000001 1 0000000000000000000  2 
00000000000001 1 000000000001 00001 000000000000001000001 000001 00010000000  8 
0000000000001 1 1 00000000000000001000000000000001 000001 1 00001 00010000000  8 
0000010000000000000000000000011100000000000000010000001100000001000010  4 
0000001 00100000000001 1 000000001 000000000000000001 1 1 0000000010000001 000  5 
0000000000000001 00000000000001100000000000000001 0000001 1 00000001000010  4 
000000000000000101 0000101 00000001 000000001 001 00000000001 1 0000000000001  1 0 
000000000000000001 00001 01 1 0000000000000001001 000000000001 1 000000000001  1 0 
000000000000000000000000000001 1 00000000000000001 0000001 1 00000001000010  4 
0000000000001 1 1 00000000000000001 000000000000001 101001 00000100010000000  8 
000000000000000001 0000101 00000000001 000001001 00000001 00010000000000001  1 0 
001000001001 00000001 00010000001 01 000000000000000001 0000000000000000000  2 
1 000000000000000000000000001 00000001 01 00001000000000000000001 1 00000000  3 
00000001 0000000000000000000001 1 00000000000000001 0000001 1 00000001000010  4 
01000000000100001000000000101 1 01 00000001 1 000000000000000000000001 00010  1 
1 000000000000000000000000001 0000000101 0000100000001 0000000001 1 00000000  3 
1 000000000000001 0000000001 01 0000000101 00001000000000000000001 1 1 0000000  3 
000001 1 000001 0010011 00000000000000101 1 0000000100000001 0000000000000000  9 
00100000100101000001 1 001 000000001 000000000000010011 00000000001 00000000  2 
00001001 00001 0000000000001 00000000000000000100000100000001 0000000001 00  7 
00001 001 000000000000000001 0000000000000000010000010001 0001 0000001 001 00  7 
00000000000000000001 0000000001 1 000000000000000010000001 1 000001 01000010  4 
0010000000000010000000001 000011 000000000000001010000001 1 00001 001 000010  4 
1 0000000000001 1 00000000000000001 000000000000001 000001 00001 1 01010000000  8 
1 0001001 000000000000000001 000000000000000001 00001 1 00000001 1 000000001 00  7 
0001000000101 000000000000000000001 000010000000000001 01 000000000001 0000  6 
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0001 0000001 001 00100000000000000001 00001 0000000000001 01 1 00000000001 0000  6 
000001 0000001 001001001 000000000101 1 01000000001 000000001 000000000000000  9 
000000000000000000000000000001 1 000000001 000000010000001 1 00000001000010  4 
0 1 00000000001 000 1 0000000001 1 1 00000000001 1 000000000000000000000001 00000  1 
01000000001 000001000001 00010100000001001 1 00000000000000000000000100000  1 
001000001 001 00000001 0001 000000001000000000000000001 0100001 000000000000  2 
001 001 0000001 001 001 0000000000000001 01000000001 000000000000000000001010  9 
000000000 100011 00000000000000001 000000000000001 000001 000001 00010000000  8 
1 00000000000000000000000000100010001 01 000011 001 00000000000001 1 00000001  3 
001000001 00100000001 0001 101 000001 1 00000000000000001 0000000000000100000  2 
01 0000000000000010000000001 1 1 0001 001 0001 1 00000000000000000000000100000  1 
0100000000000000100000000010100100000001 1 00000000000000000000000100000  1 
001000001001 00000001 0001000000001 000000000000000001 0000000000000000000  2 
1 00000000000000000000000000100000001 01 00001 000000000000000001 1 00000000  3 
1 01000000000000000000000001 1 00010001 01 00001 000000000000000001 1 00000000  3 
00001 001000000000000000001 000010000000000001 001 001 000000010000000001 00  7 
0 1 000000000000001 101 0000001 0 1 00000000001 1 000000000000000000000001 00000  1 
1 01 000001 00100000001 00010000001 01 0000000000000000010000000000000000000  2 
00000000000001 1 00000000000000001 001 000000010001 000001 000001 00010000000  8 
00001 001000000000000000101 00000000000000001 1 00001 1 00000001 0000000001 00  7 
.  00000000000001 1 00000000000000001 000000000000001 000001 00100100010000000  8 
00000000000001 1 00000000000000001000000000000001 000001 000001 00010000001  8 
00000000000001 1 00000000000000001 0000001 00000001 000001 100011 0001 0000000  8 
001 00000000000000100001 1 1 0000000000000000 1 001 0000001 00001 0000000000001  1 0 
001000001 00100000001 0001 01 0000001 0000000000000000010000000000000000000  2 
000000001 000000001 0000101 00000000000000001 001 000000000001 0000000000001  1 0 
1 000000000000000000000000001 00000001 010000101 0000000000000001 1 00000000  3 
000000000000000000000000000001 1 00000000000000001 000000 1 1 00000001 00001 0  4 
0001 0000001 00000000000000001 000001 00001 00000001 00001 01 000000000001 0000  6 
0000000000000001000000000000011001000000001000011000001100000001000010  4 
000000000000000000010000000001 1 00000000000000001 0000001 1 00000101 00001 0  4 
00001 001000000000000000001 000000000000000001000001 00000001 0000000001 00  7 
00000000000001 1 00000000000000001 0000000001 00001 01 00010001 01 0001001 0000  8 
0101 0000001 00000000000000000000001 00001 00001 0000000101 000000000001 0000  6 
00001 001000000000000000001 000000000000000001 000101 00000001 0000000001 00  7 
00000000000001 1 00000000000000001 000000000000001 000001 000001 001 1 0000000  8 
000001 0000001001 001 0000001 000000001 01 000000001 000000000000000000000000  9 
000001 000001 1 001 0010000000000000001 01000000001000000000000000001 000000  9 
000001 0000001 101001 000000000000000101 00000000101 000000000001 0000000000  9 
1 000000000000000000000000001 0000000101 00001000000000000000001 1 00000000  3 
0000010000001 001001 0000000000000001 01 00000000101 0100000000000000000000  9 
0001 000000100000000001 1 00000000001 00001 000000000000101 0000000000010000  6 
001000001001 00000001 0001000000001000000000000000001 0000000000001000000  2 
000001 0000001001001 000001 000000000101 00001 0001000000000000000001 000000  9 
001000001 001 00000001 0001 000000001000000000000000001 000001 1 000000000000  2 
0010000010010000000100010000000010000000001000000010000000000000000000  2 
000001 0000001 001011 0000000000001001 0101 000000100000001 001 1 000000000000  9 
00001001000000000000000001 000000000000001 00100000100000001 00000001 1 1 00  7 
00101 001 000000000000000001000000000000000001 000001 000000010000000001 00  7 
0000100100000000000000000100000000000000000100000100000001000000000100  7 
00001 001 000000000000000001000000000000000001 000001 00000001 0000000001 00  7 
0001 0000001 0000000000000000000000100001000000000000101 00001 0000001 0000  6 
0000001 001 00000000001 1 000000000000000000000000001000001 00001 0000001 000  5 
000001 0000001 001001 0000000000000001 01 001 000001 0000001 00000100000000000  9 
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00001001 00000000000000000100000000000000000100010100001 0010000000001 00  7 
000000000000000001 000010100000000000000001 0010000000000010000000000001  1 0 
0001000000100000000000000000000001 00001 0000000000101 01 00000000001 1 0000  6 
1 000010000001 001001 00000000000001 01 1 1 000000001 000000000000000000000001  9 
00000000000001 1 00000000000000001 000000000000001 00001 1 00000100010000001  8 
01 000001 000000001000000000101 00000000001 1 00000100000000000101 000100000  1 
0000000000001 00000000000000001 1 00000000000000001 0000001 1 0000000 1 0000 1 0  4 
000001 0000001 001001 000000000000000101000000001 000000000000000000000000  9 
00001 001000000001000000001000000001 0000001010000010001 0001000000000100  7 
00000000000001 1 000000000001 01 001 000001000000001 000001 1 00001 00010001 000  8 
0000001 001 00000000001 1 000000000000000000000000001 00000000001 0000001 000  5 
00000000000000001 0000000000001 1 00000000000000001 0000001 1 00000001000010  4 
000001 110101 000000001 1 0000001 00000000000000000001 0001 000001 1 0000001 000  5 
00000000000000000000000001 0001 1 00000000001 000001 0000001 1 0000000100001 0  4 
00000000000001 1 00000000000000001 000000000000001000001 00000100010000000  8 
0000000000000000010001 1 1 1 00000000000000001 0010000000001010000000000001  1 0 
00001001 0000000000000010010000001 01 00001 001 1 000001 00000001 1 000000001 00  7 
001000001 00100000001 0001 000000001 1 0000000000000000100000000000000001 00  2 
01 000000000000001 00000000010100000000001 1 000000000000001 00000000100000  1 
00010000000001 1 00000000000000001 000000000000001000001 000001 0001 001 0000  8 
000000100100000000001 1 00000000000000000000000000100000000001 0000001 000  5 
01 000000000000001 0000010001 01 00000000001 1 000000001 000000000000001 00000  1 
0010000010010000000100010000000010000100000100000010000000000000000000  2 
001000001001 00000001 0001000000001000000000100000001 000000000000001 0000  2 
00000000000001 1 00000000000000001 0000000001 00001 000001 01 00010001 1 000000  8 
00010000001001 000001 00000000000001 00001000000000000101 1 00000000001 0000  6 
000000100100000000001 1 001000000000000000000000001 1 0001 000001 0010001 000  5 
00001 000000001 1 00000000100000001 000000000000011001001 00000100010000000  8 
01 0000000000000010000001001 01 001 00000001 1 000000010000000000000001 00000  1 
00001 001 000000000000000001 0000000000000000010000010000000100000001 01 00  7 
000000000000000000000000000001 1 00001 01 0000000001 0000001 1 00000001000010  4 
01 00000000000000100000000010100000000001 1 00000000001 0000000000001 00000  1 
0001000000100101000000000000000001 00001 0000000000001 01 000000000001 0000  6 
1 0000000000001 1 00000000000000001 001 000000000001 000001 000001 00010000000  8 
01 000000000000001000010000101 00000000001 1 000000000000000000000001 1 0000  1 
000100000010000000001 00000000001 01 000010000001 00000101 000000000001 0000  6 
0100000000001 000100000000010100000000001 1 00000000000000000000000100000  1 
1 0000000000001 1 00000000000000001 0000001 00000001000001 000101 0001 0000000  8 
01000000000001 1 00000000000000001 000000000000001 000001 00000100010000000  8 
0000010000001001001 000100000000000101 1 00000001 000000000000000000000000  9 
00000000000001 1 0000000000000001 1 000000000000001 000001 1 0000100010000000  8 
00000000000001 1 00100000000000001 000000000000001 000001 000001 000 1 0000000  8 
1 000000000000000000000000001 0000001 1 01 00001000000000000000001 1 00000000  3 
1 000000000000000000000000001 00000001 01 00001000000000000000001 1 00000000  3 

30 
34 
29 
32 
24 
26 

31 
37 
28 
29 


32 


Appendix  IV.  Application  of  the  ART  Algorithm  to  Simulated 
Symptom  Data 

This  represents  the  output  from  the  ART  algorithm  in  Appendix  I  when  presented 
with  the  data  in  Appendix  III.  The  two  numbers  at  the  end  of  each  symptom  vector  line 
are  the  numerical  cluster  name  assigned  by  the  algorithm  and  the  original  cluster  name. 
Note  that  the  two  names  do  not  coincide  because  the  data  are  random  with  respect  to  the 
underlying  clusters.  That  is,  the  ART  algorithm  recognizes  clusters  in  the  order  in  which 
it  detects  them,  and  then  assigns  a  name  beginning  with  the  maximum  number  of  clusters 
(20  in  this  run)  and  decrements  the  name  by  1  for  each  new  cluster  detected.  For 
example,  original  syndrome  number  4  is  the  first  cluster  detected,  so  it  is  assigned 
number  20  by  the  ART  algorithm.  Next,  original  syndrome  number  6  is  detected,  and  it 
is  assigned  number  19,  and  so  on.  At  the  end  of  the  data  set  is  a  tabulation  of  the  number 
of  patients  detected  in  each  of  10  clusters  (followed  by  ten  Os,  indicating  that  the 
algorithm  did  not  establish  any  of  the  up  to  10  additional  clusters  it  could  have),  followed 
by  the  sum  of  these  numbers  (300,  indicating  that  every  patient  was  found  in  a  cluster, 
that  is,  there  were  no  "unclassified"  patients).  Finally,  the  last  set  of  records  at  the  end  of 
the  data  set  are  the  syndrome  prototype  vectors  along  with  the  corresponding  number  of 
patients  clustered  with  each.  A  comparison  of  the  data  in  this  appendix  with  that  in 
Appendix  III  shows  that  the  ART  algorithm  identified  all  10  syndromes  correctly,  and  it 
correctly  clustered  all  the  patients  correctly  without  error. 

000000000000000000000000000001 1 1 00000000000000010000001 1 00010001 0001 1 0  20  4 
0000000100000000100010000000011000000000000010010000001100000001000010  20  4 

0001 1 000001 00000000000000000000001 00001 0000000000001 01 000000000001 0000  1 9  6 
00000000000001 1 00000000000000001000000000000001 000001 0000010001 0000000  1 8  8 
000001 0001 001 001 0010000000000000001 01 000000001001000000000000000001000  1 7  9 
01000000000000001 00000000010100001 000001 1 0000000000000000001 00001 00000  16  1 
0000000000000000010000101 00000000000000001 001000000000001 00001 00000001  1510 
000001 0000001 001 1 01 000000000000100101 000000001 000000000000000000000000  1 7  9 
00000000000000000 1 0000101 000000001 00000001 00 1 000000000001 000001 0000001  1510 
00001 00100000000001 0000001 000000000000000001 00000100000001 0000000001 00  14  7 
000000000000000001001 010100000000000000001001 000000000001 0000000000001  1510 
000000100100000000001 1010000000000000000000000001000000000010000001000  13  5 
00000000001 000000 1 0000101 00000000000100001 001000000000001 000000000000 11510 

1000000000000000000000000001000000010100001000000000000000001100000000  12  3 

00000000000000000100001 01 00000000000000001 001000000000001 0000001000001  1510 

0000100100000000000001000100010000000000000100000100000001000000000100  14  7 

0000001 001 00000000001 1 00000000000000100000000000100000000001 0000001 000  13  5 
001 00000100100000001 0001 000000001 000001 1 00000000001 0000000000001 000000  1 1  2 
0001 0000001 00000000000000000000001 00001 0000000000001 01 0000000000010000  1 9  6 
0000000 1 0000000001000 1101 00000000000000001001 000000000001 000000000000 11510 
000001 01 00001001 001 000000000000000101 000000001000000000000000000000000  17  9 
01000000000000001 000000000101 00100000001 1 00000000000000000000000100000  1 6  1 
000000000000000000000000001 001 1 0000001 0000000001 0000001 1 00000001 00001 0  20  4 
1 00000000000001 0000000000001 00000001 01 0000 1 00000000000000000 1 1 0000000 1123 
1 0000000000000000000000000010000000101 00001 000000000000000001 1 00000000  12  3 
0001000000100000000000000000000001 100010000000000001010000000000010000  19  6 
1000000000000000000000000001000000010100001000000000000000101100000000  12  3 

0000001 001 00000000001 1 0000000000001001 0000000000100000000001 0000001000  1 3  5 
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000000000000000100000000000001 1 00000000000000001 0000001 1 0010000100001 0  20  4 
0001 0000001001 00000000000000000001 00001 000000000000101000000000001 0000  1 9  6 
0001 001001 00000000001 1 000001000000001000000000001 000000000010000001001  1 3  5 
00000000000001 1 00000000000000001000000000000001 000001 00000100010000000  1 8  8 
0001 000000100000000000000001001 00100001 0000000000001 01 000000000001 0000  1 9  6 
0000001001 00000000001 1 00000000000000010000000000100000000001 0000001000  1 3  5 
00001011 000000000000000001 0000000000 1 00000 1 1 00000 1 00000001 000000000 100  14  7 
00000001 000001 1 0000000000000001 1 000000000000001000001 00000100010000000  1 8  8 
00001 001 00000001 0000101 001 000000000000000001 00000100000001000000000100  1 4  7 
01000000000000001 0000000001 01 00000000001 1 000000000000000000000101 00000  1 6  1 
01000000000000001 0000000001 01 0000000001 1 1 0000000000001 1 0000000001 00000  1 6  1 
00000000000001 1 00000000000000001000000000000001 000001 00000101010000000  1 8  8 
000000100100000000001 1 000000001000000000000000001 00001000001 0000001000  1 3  5 
1 0000001 1 00000000000000000010000000101 1 0001 001000000000000001 1 00000000  1 2  3 
000000000000000001 0000101 1 0000000000100001 001000000000001 0000000000001  1510 
0001 000000100000000000000000000001 00001 0000000000001 01 000000000001 0000  1 9  6 
0000010000001001001001 00000000010010100000000100000000000000000000001 0179 
1 001010000100000000000000000000001 00001000000000000101001 000000001 0000  1 9  6 
0001 000000100000000000000000000001 00001 00000000001 01 01 000000000001 0000  1 9  6 
0001 000000100000000000000000000001 00001 1 00000000000101000000000001 0000  1 9  6 
00000000000001 1 00000000000000001000000000000001 1 0001 1 00000100010000000  1 8  8 
001 00001 1 001 000000010001000000001000000000000000001 0000000000000000000  1 1  2 
010000001 0000000100000000010100000000001 1 0000001 00001 000000000001 00000  1 6  1 
0000010000001 0010010000000000000001 01 00000000100000001000001 0001 1 00000  1 7  9 
001000001 001 00000001000100000000100001 0010000000001 0000000010000000000  1 1  2 
0000000100000110000000000000000101000000000000100000100000100010000000  18  8 
0001 00000010000000000000001 0000001 00001 00000000000010100000001 0001 0000  1 9  6 
0001 001000100000000000000001000001 00001 0000000000001010000000000010101  1 9  6 
1 0000000000000000000000000010000000101 00001000000000000000001 1 00000000  1 2  3 
0001 00000010000000000001 1 00000001 1 00001 000000000000101 000000000001 0000  1 9  6 
0000001 00100000000001 1 000000000010000000000000001 00000000001 0000001000  1 3  5 
1 1 0000000001 00001 0000000001 1 1 00000000001 1 000000000000000000000001 00000  1 6  1 
000001 0000001 001 001000001 1 0000000010100000000100000000000000000001 0000  1 7  9 
001 1 00001 00100000001 0001 00000000 1 000000000000000001 000000 1 000000000000  1 1  2 
1 0000000000000000000000000010000000101 00001000000000000000001 1 00000000  12  3 
1000000000000000100000000001000000010100001000000000000000001100000000  12  3 
0000000000000001000000000010011000000000000000010010011100000001000110  20  4 
000000000000000000000000000001 1 00000000000000001 0000001 1 00100001 000010  20  4 
0000010000001 001 001 000000000000000101000000001 000000000000000000000000  1 7  9 
000000000000000001 0000101 00000000000000001 001000000000001 0000000000001  1510 
0001 001 00100000000001 1 0000000000000000000000000010000000001 1 0000001000  1 3  5 
00001 001 000000000000000001 00000000000000000100000100000001 0000000001 00  14  7 
000000010000000000000000000001 1 00000000000000001 1 000001 1 00000001001 01 0  20  4 
000000000000000001 0000101 000001 00000000001 001 000000000001 0000000000001  1510 
00001 001 000000000000000001 000000000000000001000001 00000001 0000000001 00  14  7 
00000000000001 1 000000000001 1 0001000000000000001000001 00000100010000000  1 8  8 
01000000000000001 00000000010100000000001 1 000000000000000000000001 00000  16  1 
001 000001 001 00000001 000 1 000000001 000000000000000001 0000000000000000000  1 1  2 
000000000000000000000000000001 1 00000000000001001 0000001 1 00000001 00001 0  20  4 
00000000000001 1 00000000000000001000000000001 001000001 00000100010000000  18  8 
0000010000001 001 001000000000000000101000000001 000000000000000000000000  17  9 
1 00000000000000000000000000 1 0000000 1 01 0001 1 000000000000000001 1 00000000  1 2  3 
010000001 00000001 000000000101 00000000001 1 01 00001 0000000000000000100000  16  1 
0000001 001 1 0000000001 1 00000000000000000000000001 1 00000000001 0000001000  1 3  5 
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000001 0000001 001 001 0000000000000101 01 000000001 000000001 000000000000000  1 7  9 
000000000000001 001 0000101 00000001 000000001 001 000000000001 0000000000001  15  10 
0 1 000000 1 00000001 0000000001 01011000001011 000000000000000000000001 00000  1 6  1 
000001 1 000001 001 001 0000000000000001 01000000001 000000000000000000000000  1 7  9 
000001 1 1 0100000000001 1 000000000000000000000000001 00000000001 0000001 000  1 3  5 
00010000001 00000000000000000000001 00001 001000000000101 001 000001001 0000  1 9  6 
0000000000000110100000000000000100000000000000100010100000100110000000  18  8 
000000000000000000000010000001100000000001000001 110000110000000100001 1  20  4 
0001 1 001000000000000000001 00000000000000000100000100000001 000000000100  14  7 
00000000000001 1 00000000000000001000000000000001 000001 000001 0001 0000000  1 8  8 
01 00000000000000100000000010100000000001 1 000000000000000001 00000100000  16  1 
000001 0000001 001 0010000000000000001 01 1 00000001 000000000001000000000000  1 7  9 
00000000000000000100001 010000000001 0000001 0010101 00000001000000001 1001  15  10 
1 000000000000000000100000001 0000001 1 01 00001 000000000000000001 1 00000000  1 2  3 
0100101 00100000000001 1 001000000000000000000000001 00000000001 0000001 000  1 3  5 
0100000000000000100000000010100000000001110000000000000000000000100000  16  1 
00000000000001 1 000001 00000000001000000000000001 000001 000001 00010000000  1 8  8 
1000000000000000000000000001000001010110001000000000000000001100000000  12  3 
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Appendix  V.  Prototype  Graphical  User  Interface 


The  visual  basic  program  interface  allows  the  user  to  enter  symptoms  by  an  input  file,  or 
directly  on  the  screen  for  symptom  vectors  having  up  to  15  components.  It  displays  the 
proportion  of  input  assigned  to  each  class  as  the  height  of  bars.  The  number  15  was 
selected  to  be  large  enough  for  experimentation  and  demonstration  purposes  and  small 
enough  to  allow  an  uncrowded  screen. 
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